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Qh| In the context of distance oracles, a labeling algorithm computes vertex labels during pre- 

■^^ ' processing. An s, t query computes the corresponding distance from the labels of s and t only, 

^^ \ without looking at the input graph. Hub labels is a class of labels that has been extensively 

psj ■ studied. Performance of the hub label query depends on the label size. Hierarchical labels are a 

natural special kind of hub labels. These labels are related to other problems and can be com- 
puted more efficiently. This brings up a natural question of the quality of hierarchical labels. 
We show that there is a gap: optimal hierarchical labels can be polynomially bigger than the 
general hub labels. To prove this result, we give tight upper and lower bounds on the size of 



^ \ hierarchical and general labels for hypercubes. 

CN ! 1 Introduction 

> 

r~^ ■ The point-to-point shortest path problem is a fundamental optimization problem with many appli- 

ON \ cations. Dijkstra's algorithm [5] solves this problem in near-linear time [9] on directed and in linear 

time on undirected graphs |11) . but some applications require sublinear distance queries. This is 
'^ ■ possible for some graph classes if preprocessing is allowed (e.g., [HE]). Gavoille et al. introduced 

a distance labeling algorithm [7j that precomputes a label for each vertex such that the distance 
between any two vertices s and t can be computed using only their labels. A special case is hub 
labeling (HL): the label of u consists of a collection of vertices (the hubs of u) with their distances 
from u. Hub labels satisfy the cover property: for any two vertices s and t, there exists a vertex w 
on the shortest s-t path that belongs to both the label of s and the label of t. 
^_' Cohen at al. [3] give a polynomial-time 0(logn)-approximation algorithm for the smallest la- 

beling (here n denotes the number of vertices). The complexity of the algorithm, however, is fairly 
high, making it impractical for large graphs. Abraham at al. [T] introduce a class of hierarchical 
labelings (HHL) and show that HHL can be computed in 0*{nm) time, where m is the number of 
arcs. This makes preprocessing feasible for moderately large graphs, and for some problem classes 
produces labels that are sufficiently small for practical use. In particular, this leads to the fastest 
distance oracles for continental-size road networks pj. However, the algorithm of [Ij does not have 
theoretical guarantees on the size of the labels. 

HHL is a natural algorithm that is closely related to other widely studied problems, such as ver- 
tex orderings for contraction hierarchies [8] and elimination sequences for chordal graphs (e.g., |10j). 
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This provides additional motivation for studying HHL. This motivation is orthogonal the relation- 
ship of HHL to HL, which is not directly related to the above-mentioned problems. 

HHL is a special case of HL, so a natural question is how the label size is affected by restricting 
the labels to be hierarchical. In this paper we show that HHL labels can be substantially bigger 
than the general labels. Note that it is enough to show this result for a special class of graphs. We 
study hypercubes, which have a very simple structure. However, proving tight bounds for them is 
non-trivial: Some of our upper bound constructions and lower bound proofs are fairly involved. 

We obtain upper and lower bounds on the optimal size for both kinds of labels in hypercubes. 
In particular, for a hypercube of dimension d (with 2^^ vertices), we give both upper and lower 
bounds of 3 on the HHL size. For HL, we also give a simple construction producing labels of size 
2.83 , establishing a polynomial separation between the two label classes. A more sophisticated 
argument based on the primal-dual method yields (2.5 -|- 0(1))"^ upper and lower bounds on the HL 
size. Although the upper bound is non-constructive, it implies that the Cohen et al. approximation 
algorithm computes the labels of size (2.5 -|- o(l))'^. 

The paper is organized as follows. After introducing basic definitions in Section [21 we prove 
matching upper and lower bounds on the HHL size in Section [3j Section |4] gives a simple upper 
bound on the size of HL that is polynomially better than the lower bound on the size of HHL. 
Section [5] strengthens these results by proving a better lower bound and a near-matching upper 
bound on the HL size. Section [6] contains the conclusions. 

2 Preliminaries 

In this paper we consider shortest paths in an undirected graph G = {V, E), with \V\ = n, \E\ = m, 
and length i(a) > for each arc a. The length of a path P in G is the sum of its arc lengths. 
The distance query is as follows: given a source s and a target t, to find the distance dist(s,t) 
between them, i.e., the length of the shortest path Pgt between s and t in G. Often we will consider 
unweighted graphs {i = 1). 

Dijkstra's algorithm f5] solves the problem in 0(m + nlogn) [6] time in the comparison model 
and in linear time in weaker models |llj. However, for some applications, even linear time is too 
slow. For faster queries, labeling algorithms preprocess the graph and store a label with each vertex; 
the s-t distance can be computed from the labels of s and t. We study hub labelings (HL), a special 
case of the labeling method. For each vertex v dV, HL precomputes a label L{v), which contains a 
subset of vertices (hubs) and, for every hub u the distance dist(t;,n). Furthermore, the labels obey 
the cover property: for any two vertices s and t, L{s) (1 L(t) must contain at least one vertex on the 
shortest s-t path. 

For an s-t query, among all vertices u € L{s) fl L{t) we pick the one minimizing dist(s,u) + 
dist(i(,t) and return the corresponding sum. If the entries in each label are sorted by hub vertex 
ID, this can be done with a sweep over the two labels, as in mergesort. The label size ofv, \L{v)\, 
is the number of hubs in L{v). The time for an s-t query is 0{\L{s)\ + \L{t)\). 

The labeling L is the set of all labels. We define its size as ^y{\L{v)\). Cohen et al. [3j show 
how to generate in O(n^) time a labeling whose size is within a factor O(logn) of the optimum. 

Given two distinct vertices v, w, we say that v ^ w ii L{v) contains w. A labeling is hierarchical 
if ^ is a partial order. We say that this order is implied by the labeling. Labelings computed by the 
algorithm of Cohen et al. are not necessarily hierarchical. Given a total order on vertices, the rank 
function r : V ^ [1 . . .n] ranks the vertices according to the order. We will call the corresponding 



order r. 

We define a d-dimensional hypercube H = {V, E) graph as follows. Let n = 2 denote the 
number of vertices. Every vertex v has an d-bit binary ID that we will also denote by v. The bits 
are numbered from the most to the least significant one. Two vertices f , w are connected iff their 
IDs differ in exactly one bit. If i is the index of that bit, we say that {v,w) flips i. We identify 
vertices with their IDs, and v (Bw denotes exclusive or. We also sometimes view vertices as subsets 
of {1 . . . d}, with bits indicating if the corresponding element is in or out of the set. Then v (B w is 
the symmetric difference. The graph is undirected and unweighted. 

3 Tight Bounds for HHL on Hypercubes 

In this section we show that a d-dimensional hypercube has a labeling of size S'^, and this labeling 
is optimal. 

Consider the following labeling: treat vertex IDs as sets. L{v) contains all vertices whose IDs 
are subsets of that of v. It is easy to see that this is a valid hierarchical labeling. The size of the 
labeling is 



E^^ 



-i". 



i=0 
Lemma 1. A d-dimensional hypercube has an HHL of size 3 . 

Next we show we that S"* is a tight bound. Given two vertices v and w of the hypercube, 
the induced hypercube Hyy^ is the subgraph induced by the vertices that have the same bits in the 
positions where the bits of v and w are the same, and arbitrary bits in other positions. H^yu contains 
all shortest paths between v and w. For a fixed order of vertices vi,V2, ■ ■ ■ ,Vn (from least to most 
important), we define a canonical labeling as follows: w is in the label of t; iff w is the maximum 
vertex of Hy^j with respect to the vertex order. The labeling is valid because for any s,t, the 
maximum vertex of Hgt is in L{s) and L{t), and is on the s-t shortest path. The labeling is HHL 
every hubs of a vertex v have ranks greater or equal to the rand of v. The labeling is minimal 
because if w is the maximum vertex in Hy^, then w is the only vertex of Hyy, D L{w), so L{v) must 
contain w. 

Lemma 2. The size of a canonical labeling is independent of the vertex ordering. 

Proof. It is sufficient to show that any transposition of neighbors does not affect the size. Suppose 
we transpose Vi and Wj+i. Consider a vertex w. Since only the order of Vi and Wj+i changed, L{w) 
can change only if either Vi G Hy.^-^^y, or Wj+i G Hy.yj, and fj+i is the most important vertex in 
the corresponding induced hypercubes. In the former case Vi^i is removed from L(w) after the 
transposition, and in the latter case vi is added. There are no other changes to the labels. 

Consider a bijection b : H ^ H, obtained by flipping all bits of w in the positions in which 
Vi and Wj+i differ. We show that Wj+i is removed from L(w) iff Vi is added to L{b{w)). This fact 
implies the lemma. 

Suppose Wj-|_i is removed from L{w), i.e., Vi € Hy.^-^y, and before the transposition Wj+i is the 
maximum vertex in Hy-^^y,. From Vi € Hy-^-^yj it follows that Vi coincides with fj+i in the positions 
in which Uj_|_i and w coincide. Thus b doesn't flip bits in the positions in which f j+i and w coincide. 
So positions in which Vj+i and w coincide are exactly the same in which 6(i;j_|-i) and b{w) coincide. 



Moreover, in these positions all four fj+i, w, b{vi^i) and b{w) coincide. So each vertex from H^.^^^^^ 
contains in -ff6(t,.^i),6(t„) and vice versa, thus implying H^^^^yj = i?6(„.^i)fe(,u). Note that b{vi+i) = Vi, 
and therefore Hy-^-^^j = H^.^y^y Before the transposition, fj+i is the maximum vertex of Hy.^yj^ 
and therefore L{b{w)) does not contain Vi. After the transposition, Vi becomes the maximum vertex, 
so L(b{w)) contains Vi. 

This proves the if part of the claim. The proof of the only if part is similar. D 

The hierarchical labeling of size 3*^ defined above is canonical if the vertices are ordered in the 
reverse order of their IDs. Therefore we have the following theorem. 

Theorem 1. Any hierarchical labeling of a hypercube has size of at least 3*^. 

4 An 0(2.83^) HL for Hypercubes 

Next we show an HL for the hypercube of size 0(2.83 ). Combined with the results of Section [3l 
this implies that there is a polynomial (in n = 2"^) gap between hierarchical and non-hierarchical 
label sizes. 

Consider the following HL L: For every v, L{v) contains all vertices with the first ld/2\ bits 
of ID identical to those of v and the rest arbitrary, and all vertices with the last \d/2'] bits of ID 
identical to those of v and the rest arbitrary. It is easy to see that this labeling is non-hierarchical. 
For example, consider two distinct vertices v, w with the same [d/2\ first ID bits. Then v € L{w) 
and w € L(v). 

To see that the labeling is valid, fix s, t and consider a vertex u with the first [d/2\ bits equal 
to s and the last \d/2'] bits equal to t. Clearly u is in L(s) (1 L{t). The shortest path that first 
changes bits of the first half of s to those of t and then the last bits passes through u. 

The size of the labeling is 2'^ ■ (2L'^/2J + 2^*^/21 ) = 0{2¥) = 0(2.83'^). We have the following 
result. 

Theorem 2. A d- dimensional hypercube has an HL of size 0(2.83 ). 

5 Better HL Bounds 

The bound of Theorem [2] can be improved. Let OPT be the optimal hub labeling size for a d- 
dimensional hypercube. In this section we prove the following result. 

Theorem 3. OPT = (2.5 + o{l)Y 

The proof uses the primal-dual method. Following [3], we view the labeling problem as a special 
case of SET-COVER. We state the problem of finding an optimal hub labeling of a hypercube 
as an integer linear program (ILP) which is a special case of a standard ILP formulation of SET- 
COVER (see e.g. |12j). with the sets corresponding to the shortest paths in the hypercube. For 
every vertex v € {0, 1} and every subset S C {0, 1} we introduce a binary variable Xy^s- In the 
optimal solution Xy^s = 1 iff »? is the set of vertices whose labels contain v. For every unordered pair 
of vertices {i,j} C {0, 1} we introduce the following constraint: there must be a vertex v G {0, 1} 
and a subset S C {0, 1} such that v € Hij (recall that the subcube Hij consists of vertices that lie 



on the shortest paths from i to j), {i,j} ^ S, and x^^s = 1- Thus, OPT is the optimal value of the 
following integer linear program: 

min > IS"! • Xv^s subject to 

' x,,s (^ {0,1}"'' V^;G{0,1}^5C{0,1}'^ 

EsD{ij}Xv,s>l V{i,j}C{0,l}^ (1) 

veHij 

We consider the following LP-relaxation of ([T]): 



mm 



y \S\ ■ Xy^s subject to 



x„,s>o "''^ yve{o,i}\sQ{o,iy 

IlsD{i,j} Xv,s > 1 V{i,i} C {0, 1}"* 



(2) 



We denote the optimal value of Q by LOFT, and bound OPT as follows: 

Lemma 3. LOPT < OPT < 0(d) • LOPT 

Proof. The first inequality follows from the fact that ([2]) is a relaxation of ([1]). 

As ([1]) corresponds to the standard ILP-formulation of SET-COVER, and ([2]) is the standard 
LP-relaxation for it, we can use the well-known (e.g., |12) . Theorem 13.3) result: The integrality 
gap of LP-relaxation for SET-COVER is logarithmic in the number of elements we want to cover, 
which in our case is O^n"^) = 0(2^^^). This implies the second inequality. D 

Now consider the dual program to ([2]). 



max y yuj] subject to 

y{^,J}>o ^'"'^ v{i,i}c{o,i}'^ 

E {^,j}cs y{ij} <\s\ yve {0, l}^ 5 c {0, i}'^ (3) 



The dual problem is a path packing problem. The strong duality theorem implies that LOPT is 
also the optimal solution value for ([3]). 

We strengthen ([3]) by requiring that the values yujx depend only on the distance between i and 
j. Thus, we have variables yo,yi, . . . ,yd- Let N^ denote the number of vertex pairs at distance k 
from each other. Note that since y's depend only on the distance and the hypercube is symmetric, 
it is enough to add constraints only for one vertex (e.g., ); other constraints are redundant. We 
have the following linear program, which we call regular, and denote its optimal value by ROPT. 



(4) 
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V5C {O,!}"^ 



Clearly ROPT < LOPT. The following lemma that in fact the two values are the same. 
Lemma 4. ROPT > LOPT 



Proof. Intuitively, the proof shows that by averaging a solution for ([3]), we obtain a feasible solution 
for ^ with the same objective function value. 
Given a feasible solution yuji for ^, define 

~ _ l^{i,j}:dist{i,j)=k VliJ} 

From the definition, 

We need to show that y^ is a feasible solution for @. 

Consider a random mapping (^: {0, 1} — )• {0, 1} that is a composition of a mapping i i— > i ©p, 
where p E {0, 1} is a uniformly random vertex, and a uniformly random permutation of coordinates. 
Then, clearly, we have the following properties: 

• ip preserves distance; 

• (/9 is a bijection; 

• if the distance between i and j is /c, then the pair ((/^(i), ^{j)) is uniformly distributed among 
all pairs of vertices at distance k from each other. 

Let S C {0, 1} be a fixed subset of vertices. As ysijx is a feasible solution of ([3]), we have 
We define a random variable X as follows: 

^ = Yl y{^d}- 

Since ip is a. bijection and y is a feasible solution of ^, we have E^[X] < \S\. Furthermore, 

\s\>B^[x] = 'E^[ 22 y{i,j}] = '^A z2 y'fi(i),f{3)\= zZ ^■/'[y^'W.v'O')]- 

Since {ip{i),ip{j)) is uniformly distributed among all pairs of vertices at distance dist(i,j), the last 
expression is equal to Yl {i,j}cs ydist{i,j)- □ 

Combining Lemmas [3] and HI we get 

ROPT < OPT < 0{d) • ROPT. 

It remains to prove that ROPT = (2.5 + o(l)) . For < A; < d, let y^ denote the maximum feasible 
value of yfc. It is easy to see that max^ Nf^.])^ < ROPT < (d + 1) • max^ N/^y'^. Next we show that 
maxfciVfcy* = (2.5 + o(l))'^. 
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To better understand ^, consider the graphs Gk for < k < d. Vertices of Gk are the same as 
those of the hypercube, interpreted as subsets of {1, . . . , d}. Two vertices are connected by an edge 
in Gk iff there is a shortest path of length k between them that passes through C^ in the hypercube. 
This holds iff the corresponding subsets are disjoint and the cardinality of the union of the subsets 
is equal to k. 

Consider connected components of Gk- By C^ (0 < i < [A;/2j) we denote the component that 
contains sets of cardinality i (and k — i). 

If k is odd or i 7^ k/2, G\ is a bipartite graph, with the right side vertices corresponding to sets 
of cardinality i, and the left side vertices - to sets of cardinality k — i. The number of these vertices 
is (J and (^,_j), respectively. C^ is a regular bipartite graph with vertex degree on the right side 
equal to (^Z*) : given a subset of i vertices, this is the number of ways to choose a disjoint subset of 
size k — i. The density of C^ is 

(^) + (/J ■ 

If k is even and i = k/2, then G\ is a graph with (J vertices corresponding to the subsets of 
size i. The graph is regular, with the degree (^1^)- The density of C^ written to be consistent with 
the previous case is again 

( •) + L\) ■ 

Next we prove a lemma about regular graphs, which may be of independent interest. 

Lemma 5. In a regular graph, density of any subgraph does not exceed the density of the graph. In 
a regular bipartite graph (i.e., degrees of each part are uniform), the density of any subgraph does 
not exceed the density of the graph. 

Proof. Let x be the degree of a regular graph. The density is a half of the average degree, and the 
average degree of any subgraph is at most x, so the lemma follows. 

Now consider a bipartite graph with X vertices on the left side and Y vertices of the right side. 
Consider a subgraph with X' vertices on the left and Y' vertices on the right. Assume X/X' > Y/Y'; 
the other case is symmetric. 

Let X be the degree of the vertices on the left size, then the graph density is x ■ X/{X + Y). For 

the subgraph, the number of edges adjacent to X' is at most x ■ X' , so the subgraph density is at 

most 

x-X' _ x-X x-X _ x-X 

X' + Y' ~ X + Y'X/X' - X + Y'Y/Y' ~ X + Y' 

D 

By the lemma, each G^ is the densest subgraph of itself, and since G^ are connected components 
of Gk, the densest C^ is the densest subgraph of Gk- 

Next we prove a lemma that gives the (inverse of) the value of maximum density of a subgraph 
ofGk. 

Lemma 6. For fixed d and k, the minimum of the expression 

\x) + \k-x) 
/d\ _ (d—x\ 
\x) \k—x) 



is achieved for x = [A;/2j and x = [A;/2] (with the two values being equal). 

Proof. Using the standard identity 

d\ fd-x\_/ d \ fd-k + x 
x) \k — xj \k — xj \ X 

we write the expression in the lemma as 

1 1 {d-k)l{k-x)\ {d-ky.x\ 



+ ...,.. =- 7T^^^^^ + 



(fc-x) C^'x^'') (d-xy. ' {d-k + xy.' 

Since d — k is a constant, we need to minimize 

1 1 



(k-x + l) ■ ...■ {d-x) (x + l) ■ ... ■ [d-k + x] 
Note that the expression is symmetric around x = k/2: for y = k — x, the expression becomes 



(5) 



1 1 

+ 



{y + l)-...-{d-k + y) {k-y+l)-...-{d-yy 

So it is enough to show that for x > \k/2'] , the minimum is achieved at x = [fe/2] . 
We will need the following auxiliary lemma. 

Lemma 7. If < s <t and a > (3 > I, then at + s//3 > t + s. 

Proof Since 2 <a + l/a <a + 1//3, we have a - 1 > 1 - 1//3. Thus, (a - l)t > s(l - 1//3), and 
the lemma follows. D 

It is clear that for every x the first term of ([5]) is not less than the second one. If we move from 
X to X + 1, then the first term is multiplied by (d — x)/{k — x), and the second term is divided by 

{d-k + x + l)/(x + 1). Since 

d-x d-k + x + l _{d-k){2x + l-k) 
k — x x + l (x + l)(A; — x) ~ 

we can invoke Lemma [7] with t and s being equal to the first and the second term of ^, respectively, 
a = {d - x){k - x), P = {d - k + X + l)/(x + 1). D 

Recall that y^ denotes the maximum feasible value of ijk ■ 

Lemma 8. 



Vk 



1 k = 

2/(^^7^) k = 2i,i>0 

,((') + (m))/((')-(m)) ^ = 2^ + 1- 



Proof Fix k and consider the maximum density subgraph of G^- Inverse of the subgraph density 
is an upper bound on a feasible value of y^. 

On the other hand, it is clear that we can set y^ to the inverse density of the densest subgraph 
of Gfc and other y's to zero, and obtain the feasible solution of (jj]). 

By applying Lemma [6l we obtain the desired statement. D 



Recall that N^ denotes the number of vertex pairs at distance k from each other. For each vertex 
V, we can choose a subset of k bit positions and flip bits in these positions, obtaining a vertex at 
distance k from v. This counts the ordered pairs, we need to divide by two to get the the number 
of unordered pairs: 

except for the case k = 0, where A^o = 2 . 

Finally, we need to find the maximum value of 



^Pik) := Nk-rk = 2" 



/ d\ ifd—i 
\2 



•)/(V) k = 2i 



\2i+l) 



) + (m) / 2-(?)-U ^ = 2. + l. 



One can easily see that il){2i + l)/ip{2i) = {d + l)/(4i + 2). So, if we restrict our attention to 
the case k = 2i, we could potentially lose only polynomial factors. 

We have 

■0(2^ + 2) _ d-i 

V'(2i) ~ Ai + 2' 

This expression is greater than one if i < (d — 2)/5. The optimal i has to be as close as possible to 
the bound. As d — ^ oo, this is | • (1 + o(l)). 

We will use the standard fact: if for n — > cxo, m/n — > a, then 



(2^(°)+o(i)r, 



where H is the Shannon entropy function H{e) = — elog2 e — (1 — e) log2(l — e). 
Thus, if d — )• oo, k/d -^ 2/5, then 

One can verify that 

2l+H(0A)~0.8-H{0.25) _ 2 5 

SO we have the desired result. 

6 Concluding Remarks 

We show a polynomial gap between the sizes of HL and HHL for hypercubes. Although our existence 
proof for (2.5 + o(l)) -size HL is non-constructive, the approximation algorithm of [3] can build 
such labels in polynomial time. However, it is unclear what these labels look like. It would be 
interesting to have an explicit construction of such labels. 

Little is known about the problem of computing the smallest HHL. We do not know if the prob- 
lem is NP-hard, and we know no polynomial-time algorithm for it (exact or polylog-approximate) . 
These are interesting open problems. 

The HL vs. HHL separation we show does not mean that HHL labels are substantially bigger 
than the HL ones for any graphs. In particular, experiments suggest that HHL works well for road 
networks. It would be interesting to characterize the class of networks for which HHL works well. 
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Note that an arbitrary (non-hub) labeUngs for the hypercube can be smaU: we can compute the 
distances from the standard d-bit vertex IDs. It would be interesting to show the gap between HL 
and HHL for graph classes for which arbitrary labelings must be big. 

We believe that one can prove an 0*(n^'^) bound for HL size on constant degree random graphs 
using the primal-dual method. However, for this graphs it is unclear how to prove tight bounds on 
the size of HHL. 
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